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We study dark matter (DM) production in the mono-Z channel at the 13 TeV LHC both in 
an effective field theory framework as well as in simplified models with vector mediators, using a 
multivariate analysis. For DM-quark effective operators with scalar, vector, and tensor couplings 
and DM mass of 100 GeV, the 5a reach in the DM interaction scale A is around 2, 1, and 3 
TeV, respectively, for 3 ab“^ and assuming a 5% systematic uncertainty on the total background 
normalization. For simplified models with leptophobic vector mediators, the 5a reach for the mass 
of the mediator is 1.7 TeV also assuming a 5% systematics and 3 ab“^ of integrated luminosity. The 
reach for the dark matter interaction scale obtained with the multivariate analysis using a likelihood 
function discriminant is at least twice as high as that obtained from a simple cut and count analysis, 
once systematics on the background normalization larger than a few percent are taken into account. 
Moreover, the reach is much more stable against degradation due these systematic uncertainties. 
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I. INTRODUCTION 


The particle nature of dark matter (DM) is one of the most important unsolved questions in physics, and the 
focus of immense experimental and theoretical endeavors. One of the most promising arenas for such endeavors 
is the study of DM in colliders. 

Approaches to study the dark matter parameter space include effective operators [1] and simplified models 
[5], with a mono-X observational signature, where X may denote monojet [5], mono-photon [3], mono-Z [SJ|5], 
mono-IT [7], mono-Higgs, mono-& [5], or mono-top [5]. In studies with effective operators, limits on the cut-off 
scale A of the low energy DM effective field theory have been reported. 

The purpose of the current paper is to explore the reach at 13 TeV for the effective interaction scale between 
DM and quarks as well as electroweak bosons. Moreover, we also study simplified models with DM-quark inter¬ 
actions mediated by leptophobic vector mediators. We explore the mass reach for such mediators. We choose 
the mono-Z channel for our study, since, compared to monojet, it is expected that background uncertainties 
should scale more favorably for the mono-Z signature. In mono-Z studies, a pair of charged leptons that re¬ 
construct to a boson recoil against the missing transverse momentum carried by dark matter. The dominant 
background in this case is constituted by SM processes ZZ Using ATLAS bounds on ZZ —>• Uiyiy [S], 

constraints were put on the dark matter effective scale A in [^. 

We will be especially careful to incorporate the effects of systematic uncertainties, which are expected to 
become increasingly important in the current run of the LHC. Our strategy is to obtain a set of kinematic 
variables that can discriminate between signal and background, and construct a multivariate likelihood function 
on which cuts can be imposed. Broadly, we find that the reach for the DM interaction scale A obtained with 
the multivariate analysis using a likelihood function is at least twice as high as that obtained from a simple cut 
and count analysis, when systematic uncertainties greater than a few percent are taken into account. Moreover, 
the reach is much more stable against degradation due to systematics in the total background normalization. 

The rest of the paper is structured as follows. In Sect ion [TTl we first discuss both DM effective operators 
and simplified models with vector mediators. In Section |HH we discuss the kinematic variables used in the 
construction of the multivariate discriminant. In Section |IV[ we discuss systematic uncertainties. In Section 
|vj we discuss the construction of the multivariate (MVA) discriminant. In Section VL we present the main 
results of our multivariate study for the DM effective operators, while in Section VH we present the results for 
the Z' model. We end with our conclusions. 


II. BACKGROUND: EFFECTIVE OPERATORS AND VECTOR MEDIATORS 

In this Section, we describe the theoretical background of our study. We first discuss DM effective operators, 
and then turn to simplified models with vector mediators. 


A. Effective Operators 

Effective field theories for dark matter interacting primarily with SM quarks have been considered in Refs. [U 

laiinHiTi. 

We consider the following interactions 

^9 I XX + X7m75X 

+ 7^97^9 X7mX + (1) 

In the above, x denotes a Dirac fermion that we assume to be the DM particle. The SM quarks are denoted by 
q. Dl, D5, D8, and D9 denote effective DM-g operators with scalar, vector, axial-vector, and tensor couplings. 
To simplify the analysis, we will assume that only one type of operator is dominant at one time, with the 
others decoupled. For the scalar operator Dl we assume an underlying model with a scalar mediator S with 
democratic couplings to all quark flavors. A Higgs portal model, on the other hand, leads to quark mass 
suppressed couplings which cannot be probed by the LHC. 
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We also consider two operators considered in 

Ls = ^xx{D^.H)^D^^H (2) 

which arises at dimension 5, and 

( 3 ) 

i 

which arises at dimension 7. Here, Fi with j = 1, 2, 3 are the held strengths for the SM gauge groups. 

In [B], the CLs method was used to put limits on the cross section for new physics (and hence the scales A 
for the different operators) based on existing ATLAS results. An upper limit on the number of events coming 
from new physics was obtained from the expected number of background events with uncertainties. The 90% 
CL exclusion bounds on the scales A were given with 7 TeV data, and it was found that for some types of 
operators, scales of the order of 100 GeV - 1 TeV could be probed. 


B. Vector Mediators 


We now turn to simplihed models with a leptophobic vector mediator. The Lagrangian is 


£ = - 


9 

2cw 


5/9i7^(l - 75 ) 9 *^^ + 9xXl^{^ - 75)x 




(4) 


where the qi are SM quarks, gj and parametrize the coupling between quarks and the vector mediator Z', 
and between the DM candidate y and the Z', respectively, in terms of the SM coupling 2^1 where g is the 
SU{2)l coupling and cw is the cosine of the Weinberg angle. In our simulations we choose gf = g^ = I and 
= 100 GeV as a benchmark point. 

For the benchmark point, the branching ratios of a 500 GeV Z' to dark matter, jets and top quark pairs are 
5.2%, 80.2% and 14.7%, respectively, and remain unchanged for larger masses. The ratio of the total width to 
mz' is less than 0.1 for masses up to 4 TeV. 

Collider studies of the Z' model have been performed by many authors in recent years, starting from the work 
of [14j . Bounds from direct and indirect dark matter detection and from colliders for a leptophobic Z' portal 
model were derived in [15]. Overall, for couplings of SM size and a 100 GeV DM, strong bounds from LUX 
and dijet searches at the LHC 8 TeV exclude a Z' portal up to mz' ~ 2.8 TeV. However, these bounds drop 
to mz> ~ 2 TeV for a 50% suppressed couplings to quarks or heavier DM. On the other hand, the branching 
ratio of the Z' to DM is mildly dependent on until it gets very close to the decay threshold, but increases 
very fast as the couplings between Z' and quarks get smaller. As a result, for a 500 GeV dark matter and 50% 
suppressed couplings to quarks, an 1 TeV Z', for example, is not excluded by any experiment and is able to 
reproduce the observed DM relic density [T 8 ] . 


III. KINEMATIC VARIABLES FOR MULTIVARIATE ANALYSIS 

In this Section, we describe the kinematic variables that will be used in our multivariate analysis. In the 
construction of a discriminant with a higher performance level than simple event counting, at least in certain 
regions of phase space, the information from the shape of the kinematic distributions is crucial. Combining 
this information in a likelihood function is one of the most simple, robust and straightforward means to build 
a multivariate discriminant between signal and background events. 

In order to discriminate a dark matter signal at the LHC, we study the process 

pp ^ z{^ £+r)xx , (5) 

where the DM y interacts with quarks via operators Dl, D5, D 8 , and D9 and directly with gauge bosons via 
operators L 5 and Ly. We also considered a process where dark matter is produced through the production and 
decay of a new leptophobic massive gauge boson Z' 

pp Z{->- t£-)Z'{-^ xx) , 


( 6 ) 
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TABLE I. Production cross sections (in fb) for pair production of DM in association with a Z boson and one extra QCD 
jet at LHC 13 TeV: pp —>■ Zxx + i- In Ih® Isft Pnrt of the table we display the rates for the EFT operators considered 
in this work for Ai = 1 TeV as a function of the DM mass. In the right part, the rates for the leptophobic Z' model are 


function of the Z' 

mass for 

9f = 

9x = 

1 and 

= 100 GeV. 




(GeV) 

D1 

D5 

D8 

D9 

Dim-5(xl0"'‘) 

Dim-7 

Mz’ (GeV) 

Z' 

< 10 

275 

47 

47 

1219 

2.5 

43 

500 

144 

100 

257 

46 

43 

1204 

1.2 

39 

1000 

21.1 

200 

218 

42 

34 

1058 

0.59 

31 

2000 

1.50 

400 

142 

32 

20 

782 

0.18 

19 

3000 

0.18 

1000 

26 

8 

3 

205 

0.08 

3 

4000 

0.03 


The DM effective operators and the Z' simplified model were implemented in FeynRules m- Signal and 
background samples for €+£“ +^t up to one extra jet are generated with Madgraph [20] followed by the parton 
showering and hadronization with Pythia [21j and the detector simulation using Delphes |22j . The hard 
(MadGraph) and soft jet (Pythia) regimes are matched according to the MLM [23 prescription at the matching 
scale Qcut = 20 GeV for the EFT operators and 40 GeV for the heavy Z' boson^ In Table (|^ we display the 
production cross section for each operator for various dark matter masses keeping A^ = 1 TeV for 13 TeV LHG 
and for the Z' production and decay to dark matter in association to a SM Z boson keeping gf = = 1 and 

= 100 GeV fixed. 

The ZZ ^ events are characterized by large and the presence of two high px isolated electrons or 
muons. The following preselection cuts were imposed, inspired by the ATLAS study of ZZ 0: 

• two same-flavor opposite-sign electrons or muons, each with > 20 GeV in \r]^\ < 2.5 and dilepton 

invariant mass close to the Z boson mass: € [76,106] GeV; Ai?« > 0.3 

• veto jets with > 25 GeV and \r]^\ <4.5. 

We also investigated an event selection without imposing the jet veto but keeping the px, V, and mu require¬ 
ments. 

At the 13 TeV LHC, the dominant SM backgrounds after the Z window selection consists of ZZ + j and 
it events, with a sub-dominant contribution from WW,Wt,WZ,T'^T~. The Wt and r+r” backgrounds are 
negligible after we demand a MET cut: 

• ^x> 100,150, • • • , 500 GeV 

We chose the following kinematic variables to better classify the signal and background events both in the 
EFT as well as the simplified Z' model frameworks: 

• Missing energy ^x is expected to be the kinematic variable that offers one of the best avenues of discerning 
signal and background. However, this variable is used as a cut variable in order to reduce the number of 
background events but not to build the likelihood functions. At the LHC, the transverse missing energy 
serves as a trigger to new physics and a cut on l^x is always necessary. In any case, ^x is correlated to 
other variables which we actually use to construct the multivariate discriminant of our signal models and 
the backgrounds. 

• The product ^x x cos ,Px)^ , where At/) is the angle between the two dimensional vector A™®® 

and the transverse momentum p^ of the Z boson candidate. This serves as a measure of axial-.^Tj which 
is defined as the projection of i?™®®® along the direction opposite to the Z candidate [Sj- This variable 
has a great potential to discern between the various DM operators as suggested in [3- 

• The fractional px difference \^x — PxMPt I2I- 

• The azimuthal separation of the two leptons ,£~). 


^ Although we do not use it in the curent work, we note that NLO cross sections for the signal have been computed at the parton 
level and implemented in MCFM by [24], and in POWHEG by |25| . 
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• ar = i?T(^ 2 )/AfT, where Et 2 is the transverse energy of the softest lepton of the £+£ pair and Mt = 
^J{ETl + Et 2 Y — {pxi +Px 2 Y — {Pyi + Py 2 Y■ This variable has been studied by [26] , 

• The angular variable cos(0*) ^7} where 9* is defined as the boost invariant tan(0*) = tanh ^ , jn 

the case of sparticles production and decay to short chains, as slepton to leptons + [ 22 ], or sbottoms 

to bottom jets + | 1 H|, this variable is shown to be correlated to the production angle of the sparticles. 

• The contransverse mass MTc = -y/2 {pTi ■ PTi + Pt^PTi ) P5] . 

• The angular variable cob{9m) 130], the cosine of the angle between the leptons’ direction of motion and 
the beam-axis in the centre of mass frame of the leptons. Compared to cos{9*), cos(0m) is not sensitive 
to end point effects observed as | cos( 6 >*)| ^ 1, which diminishes the discerning power of the 9* variable 
in this region. 

We now turn to a discussion of the normalized distributions of the above variables. In the upper left panel of 
Fig. 0 we show the ]^t distribution for the SM backgrounds (dashed lines) and EFT interactions (solid lines). 
The black, blue, red, and green solid lines show the distributions for Dl, D5, D9, and the dim-7 operators. 
Not surprisingly, vector (D5) and vector-axial couplings (D 8 ) (not shown in the figure but very similar to D5) 
have the closest resemblance to the ZZ background (black dashed). The other EFT interactions, in turn, show 
harder distributions. The other panels of Fig. 0 display those variables which are correlated to In the 
upper right panel, we show the variable ot, while in the bottom left and right panels we show the axial-.^T 
and contransverse mass MTc, respectively. In these figures, the SM background is shaded in orange, while the 
solid black and blue lines denote operators Dl and D5, respectively, while dotted black and blue lines denote 
D9 and the dim-7 operators, respectively. 

In the Fig. 0 we show the angular variables used in the construction of the multivariate discriminant after 
the selection cuts and a missing transverse cut I^t > 150 GeV. In the upper left panel we show the azimuthal 
angle between the leptons. This is more peaked to small values for EFT operators than the SM backgrounds, a 
consequence of harder missing transverse momentum of dark matter which makes the signal lepton pairs more 
collimated. In other words, the Z boson transverse momentum gets more correlated to the dark matter pair 
Pt as can be seen at the upper right panel where, again, the EFT interactions lead to smaller fractional pr 
difference \^t — Pt\/Pt compared to the backgrounds. For the cos(0*) and cos{9m) distributions we see 
basically the same effect of enhanced collimation of EFT interactions compared to the SM events. 

As a general remark we note that the vector effective interaction is, again, the most similar to the SM 
backgrounds (as well as D 8 ), while Dl, D9 and Dim-7 are similar to each other but less similar to the back¬ 
grounds. Of course, these features will have an impact on the discerning power of the discriminants for each 
EFT operator. 

Demanding harder cuts to further separate signal events from backgrounds has a deleterious effect on 
the discrimination power of the shapes of these distributions. As increases, the Z momentum gets more 
correlated to the momentum of the dark matter pair, and higher correlation translates into increased similarity 
between signal and backgrounds distributions. This can be seen in Fig. 0 where we show, from left to right, 
the impact of increasing the cut on the ,i~) variable. 


IV. SYSTEMATIC UNCERTAINTIES AND STATISTICAL SIGNIFICANCE 

Systematic uncertainties frequently limit the reach of experiments, mainly in processes with small signal to 
background ratio. Multivariate techniques can be used to overcome these limits, extracting more information 
and circumventing the constraints imposed by systematics in the statistical significance of a signal. A very good 
example is the single top measurement first observed by the CDF collaboration m where the combination of 
four multivariate techniques was necessary to extract the signal in spite of the small signal to background ratio 
and a large number of sources of systematic uncertainties. 

Incorporating systematic uncertainties in the computation of the statistical significance level of a signal can 
be done in many ways and the subject itself has been debated in the statistical community |32j . 

There are many sources of systematics in a real experiment and taking all of them into account in a simu¬ 
lation is difficult. However, in a discovery analysis, the loss of statistical significance is, roughly, an effect of 
the widening of the probability density distribution of the chosen test statistic associated to the background 
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FIG. 1. Upper panel: Plot of (left) and ar (right). Lower panel: Plot of x cos (A(^(p5^'‘’'*,py)) (left), and 
MTc (right). Legend - upper left plot: Operator D1 (black solid line), operator D5 (blue solid line), operator D9 (red 
solid line), operator ZZxx~^^^ 7 (green solid line). SM backgrounds are dotted, with ZZ —> Mvu denoted by the black 
dotted line. Other plots: Operator D1 (black solid line), operator D5 (blue solid line), operator D9 (dotted black line), 
operator Z.^xx^dim 7 (dotted blue line), SM background (shaded orange). 


hypothesis due to the presence of the systematic uncertainties. This can be more easily understood in terms of 
the naive signihcance metric given by 


g 

^ + (esysB)^ 

where S is the number of signal events, B the number of backgrounds events, and e^ysB the systematic 
uncertainty on the backgrounds in the limit of high statistics i.e. B ^ 1. 

It is easy to invert this relation to obtain the minimum integrated luminosity required to reach a given 
statistical signihcance Z^t- If (Js and ctb are the signal and backgrounds cross sections, respectively, it can be 
shown that (tsIcpb > Zsb£sys is a constraint on the achievable signihcance. If systematics are too large, a 5cr 
discovery, for example, is not possible no matter how much data has been accumulated. 

The formula Q given above takes into account only the total number of signal and background events and 
increasing the signal to background ratio can be done only in a cut-and-count analysis. This is our approach 
in this work, constructing a distribution where signal and backgrounds events are well separated and imposing 
a cut to clean up the signal events more efficiently than the distribution. 

Systematic uncertainties in the shape of the distributions used in the multivariate analysis might also be 
important, but that is beyond the scope of our work. We note that the results obtained in our more simple 
approach to systematics are sufficient to show that taking these errors into account is crucial to reliably estimate 
the reach of the experiment. 

For the purpose of calculating the signal significance in the presence of systematic uncertainties in the number 
of backgrounds events we use, instead of Eq. Q which is shown to overestimate the significance for Ssys < 1 [32] , 
the metric proposed in [33) . Zp^, which is simple to use and is shown to be an excellent approximation to the 
consistent frequentist computation using Poisson distributions. Moreover, we found that among the three more 
reliable methods to compute the statistical significance incorporating systematic uncertainties, Zpp is the most 
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FIG. 2. Upper panel: Plot of ) (left) and \]^t — pfi/pf (right). Lower panel: cos(0*) (left) and cos(0m) 

(right). Legend: Operator D1 (black solid line), operator D5 (blue solid line), operator D9 (dotted black line), operator 
^■^XX"dim 7 (dotted blue line), SM background (shaded orange). 


conservative one. More details on these metrics are given in Appendix A. 





Ar^cc.r) 

FIG. 3. From left to right, the impact of increasing the cut (]^t > 100, 150, and 300 GeV, respectively) on the 
A(()(^+,£“) variable. 

The background estimation and signal extraction strategies would depend on upgraded detector designs 
and trigger conditions, especially in high pileup conditions. However, based on current data, we can give a 
very simple estimation of the systematic uncertainties on the ZZ —>■ tivv background. In [^, the ATLAS 
collaboration studied the process ZZ Uvv at 7 TeV with 4.6 fb“^ of data, with an event count of 39.3 ± 
0.4 ± 3.7, where the first and second uncertainties are statistical and systematic, respectively. The systematic 
uncertainties include contributions from uncertainties in the lepton reconstruction efhciency and the lepton 
energy scale and resolution, the missing transverse energy modeling, the jet veto uncertainty and uncertainties 
in the trigger efficiency, PDF and scale, and generator modeling and parton shower. In a recent study from the 
CMS collaboration, [M], the cross section for the process pp ZZ —>• Uvu was given as 88^\l{stat)'^l'^{syst) fb 
at 8 TeV with Z bosons in the mass range 60 to 120 GeV. From the above estimates, it is clear that systematic 
uncertainties ~ 0(10%) are possible. 




















































































The other SM background that is important besides ZZ + j includes contributions from tt, WW,WZ. How¬ 
ever, as we are going to show, after applying the MVA and jet veto selections they become negligible and the 
only relevant sources of systematic errors are those associated to the ZZ backgrounds as we described here. 


V. MULTIVARIATE ANALYSIS AND RESULTS 


One robust method for combining several sensitive variables is the multivariate likelihood function [SI1I3S1- 
a Bayesian learning classifier |38j . The relative probabilities of finding an event in histograms of each input 
variable described above is used, comparing between signal and background. 

Binned probability density functions for each input variable are used to construct the likelihood function 
Vk, where k denotes the event class (fc = 1 for signal, k = 2,3,4, 5 for ZZ, WW, ZW and ti backgrounds, 
respectively). We denote by fijk the probability that an event from the sample k will populate the bin j of the 
kinematic variable i. The probabilities are normalized to 




for every variable i and event sample k. The likelihood function is computed as follows. For each reconstructed 
variable i, the bin j in which the event falls is obtained. The quantities 


Pik — 


fijk 


Sm=l /b 


(9) 


are computed for every variable i and all values of k. These quantities pik are finally used to calculate the 
likelihood function 




n '^var 

i—l Pik 


Y\i=T Pir. 


( 10 ) 


where rivar is the number of input variables. The quantity Dlf = corresponding to the signal likelihood 
function, is referred to as the likelihood discriminant. 

We note that the likelihood function does not use correlations between the variables, although since the 
distributions are obtained from the fully simulated Monte Carlo data, any correlations are included in the 
modeling. More sophisticated analyses take advantage of the correlations between the variables. 

In Fig. (j^, we display the likelihood function Vlf distributions for operators Dl, D5 (D8 is very similar to 
D5), D9, dim-5, and dim-7 after cuts on have been applied. The blue line shows the distribution for signal, 
while the red line shows the distribution for background. It is clear that the signal peaks near Vle = 1, while 
the background peaks near = 0. Nevertheless, we observe smaller signal peaks in the background region, 
where the likelihood discriminant classified signal events as background events, and vice-versa. This can be 
understood if one keeps in mind that leptons from both the signal and the dominant background ZZ are the 
yields of a Z boson decay, and thus it is natural that a fraction of signal events are misidentified as background 
events. The most distinctive features of the kinematic variables are driven by the dark matter interactions and 
the identity of the source of missing energy - the neutrinos in the SM case, and the x DM in the case of new 
physics parametrized by the EFT operators. 

A harder cut results in EFT signal and background distributions for the likelihood function that look 
similar. This happens due to the fact that the signal and background shapes of the kinematic distributions 
start to become similar as the cut is increased as we have shown in Fig. To obtain optimal reach, one 
thus has to balance between the cut and the cut on Vlf- The cut is chosen to be > 100, 150, 300 
and 450 GeV. It is clear that increasing the cut gradually makes the signal and background distributions 
of 'Dff similar, reducing the efficiency of the 'Dff cut. The same behavior concerning cuts were observed 
in the case of Z' events. 

At the lower right panels of Fig. Q we show the likelihood distributions for the Z' model for Mz' = 0.5, 1, 2 
and 3 TeV fixing > 100 GeV. In this case, V^f performs better for heavier Z'. This is expected, since 
a lighter Z' tends to look like a SM Z boson, that is, the ZZ' process become less distinguishable from the 
ZZ, in spite of the fact that is due a heavy DM in the signal case. Compared to EFT operators, the Z' 
discriminant distributions discern better signal from backgrounds. 

In the next Section, we go on to give our main results for the EFT operators. 
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FIG. 4. The likelihood function Hlf is displayed for operators Dl, D5, D9, dim-5, dim-7, and the Z' model. The blue 
line shows the distribution for signal, while the red line shows the distribution for background. The EFT plots show the 
effect of the cut on the discerning power of the likelihood discriminant, while the Z' plots, at the lower right panels, 
show that heavier Z' are easier to separate from the SM backgrounds. For Z' we fixed > 100 GeV. 


VI. RESULTS FOR EFT OPERATORS 

As described previously, the reach for A depends on a judicious choice of cut and a cut on the likelihood 
function We display the discovery reach (5cr) for a variety of luminosities and choices of systematics. 

The results for the operator Dl, with a jeto veto, are shown in the upper row of Fig. The reach for the 
cut-off scale A is shown for various choices of cuts on (vertical axis) and ]^t (horizontal axis), for different 
luminosities and systematics. The reach for luminosity 300 fb“^, for systematics of 5% (20%), is 1857 (1500) 
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MET cut[GeVJ 


Dl, L=3000fb-', esys=20% 



100 200 300 400 500 

MET cut[GeV] 

D5, L=3000fb-*, s,y,=20% 



100 200 300 400 500 


MET cut[GeV] 

=l.TeV ,gf=g^=l ,L=3.ab‘', 20% 



100 200 300 400 500 


MET cut[GeV] 


FIG. 5. The reach(5(j) for the cut-off scale A is shown at the upper and middle rows for various choices of cuts on "Dle 
(vertical axis) and (horizontal axis), for 3 ab“^ of integrated luminosity. At the upper row we show the results for 
the Dl operator, at the middle row the D5 operator. At the lower row, the statistical significance for the discovery of 
an 1 TeV Z' with couplings gf — = 1 is shown. At the left(right) panels the systematic uncertainty is 5(20)%. 
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»i;j.=100GeV, MET>100GeV, MVA>0.9 



FIG. 6. Reach in A for EFT operators: The reach in A for the EFT operators at 3000 of integrated luminosity. 
The red curves display the case when jet veto is applied, while the black curves display the case when no jet veto is 
applied. The mass of the DM candidate is fixed at 100 GeV. The cuts applied are > 100 GeV and Huf > 0.9. 


GeV. The reach for luminosity 3000 fb“^, for systematics of 5% (20%), is 2209 (1542) GeV. As is clear from 
the hgure, the maximum reach is obtained for > 100 GeV, T>lf > 0.9. We observed very similar behavior 
concerning operators D9, dim-5 and dim-7. The reach for the dim-5 operator is limited to a few tens of GeV 
even for low systematics and high luminosities. We summarize in Table ( [TT| ) the results for all operators but 
dim-5. 

The results for the operator D5 are shown in the middle row of Fig.[^ The results for operator D8 are similar. 
The reach for luminosity 300 fb“^, for systematics of 5% (20%), is 843 (695) GeV. The reach for luminosity 
3000 fb“^, for systematics of 5% (20%), is 1021 (720) GeV. In this case, however, we see that a harder 
cut degrades the achievable reach of the LHC compared to the other operators. This is, again, due the larger 
similarity between the shapes of the kinematic distributions of D5 (and D8) operator and the backgrounds as 
can be seen in Figs. 0, 0 and 0. 

The final results at 3000 fb“^ for the reach in A for all the EFT operators are displayed in Fig. The reach 
in A for the EFT operators is shown at 3000 fb“^ of integrated luminosity. The red curves display the case 
when a jet veto is applied, while the black curves display the case when no jet veto is required. The mass of 
the DM candidate is fixed at 100 GeV throughout all our analyses. The cuts applied are > 100 GeV and 
Vlf > 0.9. The solid, long-dashed, short-dashed, and dot-dashed curves denote the limits for D9, Dl, D5, 
and the dim-7 operators, respectively. The D8 curves are very close to the D5 ones. 

First of all, we see that including systematic uncertainties is a really necessary ingredient to get realistic 
estimates for the reach of the LHC for dark matter. Keeping systematics under control might improve the 
reach discovery by a factor of two roughly. For example, in the D9 case, a 20% systematics in the backgrounds 
reduces the reach to ~ 2200 GeV, while keeping a 5% level or less increases the reach to ^ 3500 GeV or farthest. 

Adopting a jet veto also helps to get a larger reach for dark matter searches in the mono-Z channel if the 
systematics are larger than approximately 3-5%. 
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TABLE II. Results for EFT operators and the leptophobic dark Z' portal for three different selection cuts, integrated 
luminosities of 0.3 and 3 ab“^, and systematic uncertainties of 5% and 20% in the total background rate. The last row 
shows the effect of including a jet veto in the analysis. The blank entries in the last column indicate that a 5a discovery 
cannot be reached for that luminosity and level of systematics. 


CUTS 

L(ab-i) 

sys(%) 

Dl 

D5 

D8 

D9 

Dim-7 

Mz' 

>100 GeV 

0.3 

5 

702 

404 

398 

994 

400 

327 

20 

470 

270 

266 

665 

268 

- 


3 

5 
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A. Comparative Performances of MVA and MET Cut for EFT Operators 



FIG. 7. In the left panel, the MVA discriminant distribution for the D1 operator (solid line) and all the relevant 
backgrounds (dashed lines) for a mono-Z search. In the central(right) panel we display the cut efficiency with just a 
MET cut of 100 GeV with(without) a jet veto for all the EFT operators, but dim-5, and for ZZ and top pair backgrounds. 


The lesser sensitivity to systematics is the result of higher S/B ratios when we better classify the events 
according to the MVA discriminant. We checked that the signal to background ratio increases of an order of 
magnitude after imposing > 0.8 compared to the MET cut analysis alone, for all MET cuts, reaching 
S/B ^ 7 for a 100 GeV MET cut and Vpp > 0.9 with and without a jet veto. 

In the 13 TeV LHC, top quark pair production is not as harmless as in 7 or 8 TeV runs concerning the mono- 
Z channel. In this respect the combination of a MVA discriminant and a jet veto can be used to eliminate 
background events not involving a SM Z boson. This can be seen in Fig. Q. First, in the left panel, note that 
the MVA distributions of tt and WW backgrounds are highly concentrated near = 0, while those ZZ and 
ZW are more widely distributed. Demanding Tipp > 0.5 gets rid of all top events and the majority of WW 
backgrounds. In the central(right) panel we show the cut efficiency for a MET cut only with(without) the jet 
veto for DI, D5, D9, and dim-7 operators, and the ZZ and top pair backgrounds. Again, the top backgrounds 
can be more efficiently eliminated by tagging the hard bottom jets from top decays. 

Let us discuss the advantage of using the multivariate analysis that we have described, when compared to 
a simple cut. In Fig. we show the ratio of the reach in A when the cut is combined with MVA, 
compared to when only the ^p cut is applied. With no systematics, the reach in A with MVA is approximately 
1.2 — 1.4 times larger than the case with only ^p cut. This ratio increases dramatically with increasing 
systematics, reaching 2.0 — 2.4 for ~ 5% systematics, after which it becomes stable for a wide range of 
integrated luminosities from 300 to 3000 fb“^. This clearly shows the advantage of using the MVA for higher 
systematics. 




























































13 


D1 ,m^=100GeV D5 , m^=100GeV 




FIG. 8. The ratio of the reach in A, as a function of systematics, when the cut is combined with MVA, compared to 
when only the ]^t cut is applied, for Dl, D5, D9, and dim-7(MD7) operators. The dashed(solid) lines show the ratios 
when we do(not) apply a jet veto. The lower red curves were obtained fixing the luminsoity at 300 fb“^, the central 
blue ones at 1000 fb“'^, and the upper black curves at 3000 fb“^. 


VII. RESULTS FOR THE LEPTOPHOBIC DARK Z' PORTAL MODEL 


Applying the multivariate analysis to the same kinematic observables in this case, we obtain the likelihood 
function for different masses of Z'. We will take gj = = 1 and = 100 GeV in what follows. 

In the last row of Fig.([^, the significance is shown for the mediator mass mz' = 1 TeV, for an integrated 
luminosity of 3 ab“^ and 5 and 20% systematics. 

In Fig. the 5cr reach in m'^ for > 100 GeV cut only and cut combined with different cuts on the 
likelihood function are shown, at 3 ab“^ of integrated luminosity. It is clear from the results that for 
no systematics, the reach with MVA is around 2 TeV, while the reach with cut is around 1.5 TeV. For 
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increasing systematics, there is rapid degeneration of this reach for the case when only cut is used. For the 
MVA analysis, the degeneration is much more gradual, and even for high systematics, the reach is still above 
1 TeV. 

Contrary to the case of EFT operators, imposing no jet veto on the events is advantageous for all systematics 
if we combine MVA and MET cuts. However, for a MET cut alone, the reach is much more depleted in 
comparison to the jet veto case. A ha signal cannot be reached even for 3 ab“^ if Ssys > 0.12 as we see in 
the left panel of Fig. 0- We conclude that a jet veto removes more signal and background events, but with a 
higher signal to background ratio. 

As in the case of EFT models, the sensitivity to systematics is greatly reduced as a result of higher S/B ratios 
when we better classify the events according to the MVA discriminant. We also checked, for these Z' models, 
that the signal to background ratio increases of an order of magnitude after imposing > 0.8 compared to 
the MET cut analysis alone without a jet veto. 


no jet veto jet veto 



FIG. 9. Reach in Mz/: The 5a reach in M'z for pT > 100 GeV cut only (red curve) and ]^t cut combined with 
different cuts on the likelihood function Hlf: 0.9 (black solid), 0.8 (black dashed), and 0.5 (black dotted), at 3000 fb“^ 
of integrated luminosity. The dark matter mass is fixed at 100 GeV, and all couplings are set to one. At the left (right) 
panel we show the results without(with) a jet veto. 


VIII. CONCLUSIONS 

In this paper, we have investigated the discovery prospects of DM effective operators and simplified models 
with vector mediators at Run II of the LHC. We have paid particular attention to the question of systematic 
uncertainties. To that end, we have based our study on a multivariate analysis in the mono-Z channel. 

Our main results are the following. We have seen that the reach obtained using a multivariate discriminant 
is typically at least twice as high as that obtained from a simple cut and count analysis, once systematic 
uncertainties in the total background normalization higher than a few percent are taken into account. This is 
clear from Fig.|^ With no systematics, the reach in A with the multivariate analysis is approximately 1.2 — 1.4 
times larger than the case with only cut. This ratio increases dramatically with increasing systematics, 
reaching 2.0 — 2.4 for ^ 5% systematics, after which it becomes stable. This clearly shows the advantage of 
using the MVA for higher systematics. 

Moreover, the reach is much more stable against degradation due to systematics, both for the EFT analysis 
as well as for the Z' analysis. This is clear from Fig. where we show the results of the Z' analysis. For 
no systematics, the reach in mz' with multivariate analysis is around 2 TeV, while the reach with only 
cut is around 1.5 TeV. For increasing systematics, there is rapid degeneration of the reach for the case when 
only cut is used. For the multivariate analysis, the degeneration is much more gradual, and even for high 
systematics, the reach is still above 1 TeV. 

We also investigated the impact of including a jet veto to suppress backgrounds further. We found that a jet 
veto improves the reaches for the EFT as long as the level of systematics exceedes 3-5%. For the leptophobic 
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Z' simplified model, requiring no jet veto provides better results when we perform the multivariate analysis, 
but it is important if just a MET cut is used. 

Our reaches are the following. For DM-quark effective operators with scalar, vector, and tensor couplings 
and DM mass 100 GeV, the Scr reaches in the DM interaction scale A keeping the level of systematics at 5% 
with 300 (3000) fb“^ of integrated luminosity are 1864 (2209) GeV, 959 (1021) GeV, and 2686 (3178) GeV, 
respectively. For simplified models with leptophobic vector mediators, the 5a reach for the mass of the mediator 
is 1233 (1485) GeV with 5% systematics and 300 (3000) fb“^ of data. 
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Appendix A: Details of the significance computation 


A comprehensive study of various methods for the calculation of statistical significances has been presented 
in [32] . In this work, the problem of incorporating systematic uncertainties in the background normalization 
for a Poisson process is addressed and it is found that three most widely used significance metrics perform 
similarly in many situations concerning the relative number of signal and background events and the level of 
systematics. 

The three metrics are computed as follows: 

Zst, This is the naive and most simple way to incorporate systematic uncertainties in the calculation of the 
significances for S signal events and B background events in a Poisson process for a given integrated 
luminosity 




sb — 


s 


In all cases, we assume that the systematic uncertainty in the total background normalization is propor¬ 
tional to the number of background events, a^yg = SgysB. This is simple and fast, but as in [32] . we found 
it somewhat overestimates the reach as we show in Fig. (101, the upper blue lines. 


Zn The the Bayesian-frequentist hybrid recipe to the estimation of the systematics impact on the significance. 
Assuming that systematic errors are normally distributed we marginalize over the systematic errors to 
obtain the p-value 


P = 


+ 00 


k=S+B 


1' + °° B(l-|-2eays) 


k\ 


[B{1 + Z£sys)t X ® 




dz 


and the significance is computed as Zjv = 4> ^(1 — p), where $(z) is the cumulative distribution function 
of the standard normal distribution. 


This method demands a much larger computation effort. As we show in Fig. (10), the middle red lines, its 


performance is almost identical to the naive metric though, despite the reach in A is always shorter 
than Zsb- In the right panel we see that in the regime of a low number of events, Z^v might present 
numerical instabilities. 
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ZpL The Profile Likelihood method originally proposed in |33] in astrophysical searches with subsidiary mea¬ 
surements of the background adapted to a high energy experiment where the systematics is a fraction of 
background events 


ZpL = 


{S + B)\n 



S + B 

S + B + Byal,_ 




B^la 


2 

sys 


S + B + Bya%, 
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Among the three metrics this is the most conservative and reliable, and it is as simple and fast to compute 
as Zsb [22] ■ Moreover, its performance is very close to the consistent frequentist approach for tests of the 
ratio of Poisson means implemented in ROOT [33], for example. The Zpp results correspond to the lower 
black lines in Fig. (10). 


In order to choose judiciously among these three significances metrics, a performance comparison for our 
benchmark point in the case of determining the LHC reach for the scalar operator D1 was carried out. In 
the Fig. (10) we show the reach in A using Zsb (upper blue), Zpi (middle red), and Zpp (lower black) in the 
multivariate analysis (dashed lines) and the simple cut-and-count analysis (solid lines). In the left(right) panels 
we assume a 100(1000) fb“^ of integrated luminosity for V^p > 0.9 and a MET cut of 100 GeV and 450 GeV. 

The Profile Likelihood metric Zpp is the most conservative one, thus confirming the findings of [32]. The 
reaches, however, are similar, especially comparing Zsb and Zjv in the regime of a higher number of signal 
and background events. As the number of events decrease, as in the case of a hard MET cut of 450 GeV, the 
overestimation of Zsb and Zp[ gets more noticeable. Also, numerical instabilities appear in the computation of 
Zf^ and a greater care is necessary when using this method. 

Using the multivariate discriminant to better separate signal and backgrounds increases the S/B ratio com¬ 
pared to a simple MET cut. In this regime (dashed lines), Zsb and also overestimate the reach compared to 
ZpL, but the performances are very close with a MET cut only (solid lines), that is it, for smaller S/B ratios. 


D, ,mj.=l(IO(;eV 





FIG. 10. Comparison of significance calculation methods. The solid(dashed) lines represent the reach in A for the scalar 
D1 operator for = 100 GeV with MET(MET-I-MVA) cuts as a function of the systematic uncertainties Ssys for 100 
and 1000 fb“^. The upper, middle and lower dashed(solid) lines correspond to Zsb, Zn, and Zpl metrics respectively. 
For solid lines we apply only a 100 GeV or 450 GeV MET cut, while for the dashed ones we also impose the MVA cut 
Ble > 0.9. 
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